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Abstract — The exact Markov modeling analysis of erasure 
networks with finite buffers is an extremely hard problem due 
to the large number of states in the system. In such networks, 
packets are lost due to either link erasures or blocking by the 
full buffers. In this paper, we propose a novel method that 
iteratively estimates the performance parameters of the network 
and more importantly reduces the computational complexity 
compared to the exact analysis. This is the first work that 
analytically studies the effect of finite memory on the throughput 
and latency in general wired acyclic networks with erasure links. 
As a case study, a random packet routing scheme with ideal 
feedback on the links is used. The proposed framework yields a 
fairly accurate estimate of the probability distribution of buffer 
occupancies at the intermediate nodes using which we can not 
only identify the congested and starving nodes but also obtain 
analytical expressions for throughput and average delay of a 
packet in the network. The theoretical framework presented here 
can be applied to many wired networks, from Internet to more 
futuristic applications such as networks-on-chip under various 
communication and network coding scenarios. 

I. Introduction 

In networks, packets may have to be relayed through a 
series of intermediate nodes where each may receive packets 
via many other data streams as well. Hence, the packets may 
have to be stored at intermediate nodes for transmission at 
a later time. For infinte buffer case, the intermediate nodes 
need not have to drop the arriving packets. However, often 
times, buffers are limited in size. Although a large buffer 
size is usually affordable and preferred to minimize packet 
drops, large buffers have an adverse effect on the packet delay. 
Additionally, as second-order effects, using larger buffer sizes 
at intermediate nodes would have practical problems such as 
on-chip board space and increased memory-access latency. 

The problem of buffer sizing and congestion control is of 
paramount interest to router design engineers. Typical routers 
today route several tens of gigabits of data each second HI. 
Realistic studies have shown that, at times, Internet routers 
handle about ten thousand independent streams/flows of data 
packets. With a reasonable buffer size of few Gigabytes 
of data, each stream can only be allocated a few tens of 
data packets. Therefore, at times when long parallel flows 
congest a router, the effects of such a small buffer space per 
flow come to play. Though we are motivated partly by such 
concerns, our work is far from modeling realistic scenarios. 
This work modestly aims at providing a theoretical framework 
to understand the fundamental limits of single information 



flow in finite-buffer wired networks and investigate the trade- 
offs between throughput, average packet delay and buffer size. 

The problem of computing capacity and designing efficient 
coding schemes for lossy networks has been widely studied 
f2l-f4l. However, the study of capacity of networks with finite 
buffer sizes has been limited. This can be attributed solely to 
the fact that finite buffer systems are analytically harder to 
track in general. In ||5l, H, it was shown that min-cut capacity 
cannot be achieved due to the limited buffer constraint in a line 
network. Here, we wish to establish a framework to investigate 
the same for general wired networks. 

Recently, |3J outlined various coding strategies for achiev- 
ing capacity in line erasure networks. However, it only con- 
sidered infinite buffer in its study. Later, |7| considered the 
limitations posed by finite memory in a simple two-hop 
wireline network. Inspired by this work, in jS], Q authors 
investigated the information-theoretic capacity of multi-hop 
wireline networks with varying buffer constraints at each node. 
However, they only considered upper and lower bounds for 
the throughput. Further, the problem of finding the packet 
delay has been visited in the queueing theory literature on 
the behavior of open tandem queues that are analogous to line 
networks fSl, fOl. However, this view is not applicable directly 
in general network topologies. 

We consider wired acyclic erasure networks with ideal 
feedback on the links when a directed random packet routing 
scheme is used. Although the previous works provide some 
insight into the performance analysis of networks, they are 
limited to either infinite-buffer cases or a simple two-hop line 
network with limited memory. Moreover, the interplay of the 
throughput and latency is not considered in a small buffer 
regime. Our approach employs a discrete-time model to derive 
estimates for the buffer occupancy distribution at intermediate 
nodes. We then obtain analytical expressions for throughput 
and average packet delay in terms of the estimated buffer 
occupancy distributions. 

The motivation behind this work is twofold: From a prac- 
tical point of view, it analyzes the performance parameters of 
the random routing protocol in wired directed networks which 
is more of an interest to the network community. On the other 
hand, from a theoretical point of view, our work develops 
a framework that not only can be adapted to estimate the 
performance parameters of various communication schemes 
such as random linear coding but also can provide insight 
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into the interplay among the buffer size, throughput and delay 
in general. The later is impossible to be exactly analyzed 
due to the exponential growth in the number of states and 
the complexity of the equations ||5| but our work develops 
an approximation to it. Further, the work establishes a lower 
bound on the information theoretic capacity of finite buffer 
networks. 

This paper is organized as follows. First, we present a 
formal definition of the problem and the network model in 
Section |II] Next, we investigate the tools and steps for finite- 
buffer analysis in Section |III] We then obtain expressions for 
throughput and average delay in Section |IV] Finally, Section IV] 
presents our analytical results compared to the simulations. 

II. Problem Statement and Network Model 

Throughout this work, we model the network by an acyclic 
directed graph ^(V, ^), where packets can be transmitted 
over a link ~^ = (m, v) only from the node u to v. The 
system is analyzed using a discrete-time model, where each 
node can transmit at most a single packet over a link in an 
epoch. The links are assumed to be unidirectional, memoryless 
and lossy, i.e., packets transmitted on a link if = G 
are lost randomly with a probability of e-^ = Note 
that the erasures are due to the quality of links (e.g., noise, 
interference) and do not represent packet losses due to finite 
buffers. Clearly, erased packets are disregarded. Moreover, the 
packet losses on different links are assumed to be independent. 
Each node v <^V has a buffer size of my packets with each 
packet having a fixed size. Source and destination pairs are 
assumed to have sufficient memory to store any data packets. 
The unicast information theoretic throughput between a pair 
of nodes is defined to be the transmission rate of information 
packets (in packets per epoch) between them. The delay of a 
packet is also defined as the time taken from the instant when 
the source starts transmitting a packet to the instant when the 
destination receives it. Note that the source node can generate 
innovative packets during each epoch. 

Throughout this paper, node s and node d represent the 
source and destination nodes respectively. Also, for any x £ 
[0, 1], x= \~x. 

III. Understanding Finite-Buffer Analysis 
Here, we study the tools and steps that enable our frame- 
work for analyzing finite-buffer wired erasure networks. 

A. Communication Scheme and Buffer States 

Thoroughly investigated in 15] for the exact analysis of 
a finite-buffer line network, the problem of identifying the 
throughput capacity is equivalent to the problem of finding the 
buffer occupancy distribution of the intermediate nodes as a 
result of ergodicity of the corresponding Markov chain. Hence, 
in order to approach our problem properly it is necessary to 
clearly define the buffer states in a manner that first of all, the 
buffer states construct an irreducible ergodic Markov chain 
and second the steady state distribution of the buffer states 
could be helpful to obtain expressions for the performance 
parameters of the network. Thus, a proper definition for the 



buffer states cannot be proposed unless the communication 
scheme is known. For example, since Random Linear Coding 
(RLC) is used as the communication scheme in [5 1, the states 
are defined as the number of packets stored in the buffer of 
a node whose information has not yet been conveyed to the 
next-hop node. Analyzing RLC in a general network (which 
achieves the capacity of finite buffer regime) is very involved 
and is the subject of the future work. Instead, as a first step 
towards estimating the performance parameters we consider 
a directed random routing scheme for packets together with 
lossless zero-delay feedback on the links. 

To be more precise, the nodes operate using the following 
rules, one after another. 

1. At each epoch, every node u selects a random ordering 
of the outgoing edges and transmits the packets it houses 
one by one. If the packet is successfully received and 
stored at a neighbor, u deletes the packet from its buffer 
and transmits the next packet (if any) on the next edge 
in the selected order Else, it tries to transmit the same 
packet on the next (in the selected order) outgoing edge. 
This process is continued until all packets are transmitted 
or a transmission is attempted on each link. Therefore, a 
node with do outgoing links transmits at most do packets 
per epoch. 

2. After the transmission attempts are made, the node at- 
tempts to accept the arriving packets. If more packets are 
received than it can store, it selects a random subset of the 
arriving packets whose size equals the amount of space 
available and stores the selected packets. Consistent with 
step 1, appropriate acknowledgment messages are then 
sent. 

Note that under such a mode of operation, at any epoch when 
two nodes receive two packets from a particular node, the 
received packets have no common information. Equivalently, 
no replication is performed at any node and it is possible 
to define a concept of state or occupancy for each node 
individually. The buffer state of a node is simply defined to 
be the number of packets it stores. It can be seen that this 
concept of occupancy follows a Markov chain behavior and 
can be studied thus. 

B. Approximate Markov Chain for an intermediate Node 

Consider a node u G V in a network ^(V, i^) with di 
incoming and do outgoing edges and a buffer size of m„ 
as depicted in Fig. [T] Let the nodes that can send packets 
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Fig. L A Node in a general wired network. 

to u be denoted by M^{u) = {ui, . . . , }■ Similarly, 
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let the nodes to which u can send packets be denoted by 
Af^{u) = {wi, . . . ,Wd„}- We assume that the following 
assumptions hold in the network regarding the arrival and 
departure processes. 

1. For each k = 1,. . . ,di, suppose that the packets arrive 
on {vk , w) in a memoryless fashion with a rate of 
packets/epoch i.e., with inter-arrival times having a geo- 
metric distribution with mean j-. Also, the processes on 
different incoming links are statistically independent. 

2. At any instant, for every k = 1, . . . , do, a packet is sent 
on {u,Wk) it is successfully received and stored at Wk 
with a probability LUk independent of the past and future 
events on the edge. 

Note that this is hypothetical since in any realistic model 
of a network, the probability that a packet is successfully 
transmitted and stored at the next hop depends not only on 
the channel conditions, but also state of the next-hop node. 
Since the state of the next-hop node has dependence on its 
past, the probability of successful receipt can also be expected 
to have a dependence on its past. In fact this mode of node 
operation can be replaced by any other scheme that fits into 
the Markovian set-up of the assumptions above. 

At any instant, the number of packets arriving can range 
from up to di and the number of packets departing can range 
from to do. Hence, at each epoch, the state riu can change 
to any other in the set {n„ — do, + d^} H {0, ... , mu}. 
At any epoch, the probability ak with which k packets arrive 
and the probability Ck with which k packets depart are given 
by 

di di 

Aix) = J2 ^kx'' = Yl i^j + ^jx) (1) 



fc=0 
do 



i=i 

do 



E{x) = ^ 



k=0 j=l 



(2) 



The dynamics of the number of packets stored at u at the 
epoch is a Markov chain that is similar to the one depicted in 
Fig. El 




Fig. 2. The dynamics of a node u with rriu = 5 and di = do = 2. 

For all input parameters, the Markov chain can be shown to 
be aperiodic, irreducible and ergodic. Therefore, it possesses 
a unique steady-state distribution. Letting A = (Ai, . . . , A^J 
to denote the vector of arrival rates and SI = (cji, . . . , w^j. ) 
to denote the vector of departure rates, the unique steady- 
state distribution A, fi, m„) for the chain can be computed 
using a pair of probability transition matrices Te and TJ^ that 
correspond to the transitions between states that are effected 

'For notational consistency, we can extend = for k > do and a/; = 
for k > di. Also, for notational convenience, we use as a short-hand 
i?{-,A, Q,m„). 



by the departure and arrival of packets, respectively. Note that 
1? is the steady-state distribution after the arriving packets are 
processed. These transition matrices are defined as follows. 



(3) 
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Note that, the i, j*' entry in Te corresponds to the transition 
of the occupancy from i — 1 to j — 1 with the departure of 
i — i packets. Similarly, the i, entry in Ta corresponds to 
the transition from i — 1 to j — 1 with the arrival of i~j packets. 
The actual transition matrix for the Markov chain is then seen 
to be TeTa- The steady-state distribution -d of the occupancy 
just after the arriving packets are accepted and the steady- 
state distribution d'^ of the occupancy just after the packets 
have been sent but before arriving packets are accommodated 
are given by 



^TeTa = I? and ^^TaTe = 



(5) 



However, these two steady-state distributions are related by 
ijt — ^j"^ and ■& — •d'^TA- To evaluate the rate of information 
on the link [u, Wi), one must investigate the rule for packet 
departure. If at an epoch, more packets are stored than the 
number of links that allow successful transmission, then each 
link conveys a packet of information to its neighbors. However, 
if the occupancy riu at an epoch / is smaller than the number 
h of outgoing links that allow for transmission, each link can 
be assumed to equally receive ^ packets on the average - a 
consequence of the random selection of ordering for outgoing 
links. Then, the time average of the information rate on the 
edge (m, Wi) can be seen as 

/{{{n, u>j)}, A, n,m„) = ^ (n^^*:)^ 
ffc{o,...,do} fce-H 



n ^-)( E E 

fc'G-f/'^ j>l-tf| j<\H\ 



In a similar argument, we notice that some of the arriving 
packets get randomly blocked if all the arriving packets cannot 
be stored. We can evaluate the probability with which a packet 
arriving on the edge {vi , u) is blocked from 

HC{0,....di} k£H\{i} 

\H\ -m^+j^ 



U^^'){ E 



k'SH" 



'J<\H\ 



\H\ 
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C. Iterative Estimation of the Buffer Occupancies 

In this section, we discuss our iterative estimation technique 
in details based on the approximate Markov chain model 
introduced in section IIII-BI Considering that blocking will 
introduce dependence of the packet incoming/outgoing process 
over each edge on its past, in order to use the results of Sec- 
tion IIII-BI we have to make certain simplifying assumptions 
on the blocking phenomenon. We model the blocking on every 
edge it = {u, v) of the network as follows. 

• Every packet that arrives at v successfully (without get- 
ting erased) is blocked in a memoryless fashion with 
probability Quv Also, at any epoch, the blocking of 
packets on any subset of incoming edges of v is assumed 
to be independent of one another. 

Under the above assumption, the blocking process and hence 
the departure process on every link of the network is modeled 
as a memoryless process. Since each packet arriving on an 
edge it = (m, v) is blocked with a probability of Quv, a 
packet arriving on it is accepted only if both the channel 
allows the packet and the node accepts it. Therefore, the 
effective departure rate on the edge (u, v) is seen to be euvQuv 
Assuming that the node operates in the mode described in 
Section IIII-AI we can use ^ and ^ to identify both the rate 
of information flow and the blocking probabilities on every 
edge of the network. Thus, the problem reduces to finding a 
solution {Quv T <luv) v)£i ^^^^ satisfies the following system 
of non-linear equations for each (u, v) e i^. 

{Suv U = S 

9u„ 



u ^ s 



Quv - 



V = d 



Note that in the above equations represents the fraction 
of time at which packets will be delivered to v. However, the 
actual rate of information flow is equal to puv = QuvQuv 

Since the above set of equations are an approximation to 
the actual dynamics, it is not clear as to whether there even 
exists a solution to the above system. However, the proof of 
existence and uniqueness of the solution is detailed in |10| for 
the case of line networks. 

Finally, the solution to the system of equations can be 
found by identifying the limit of the sequence defined by the 
following iterative procedur^ 

1. Set 1 = 1 and for each edge {u, v) G i?, set qui, = and 
(1) _ / uj^s 



2. Compute gly \quv by usmg Qui, qui on the right- 
hand side of the above system of nonlinear equations and 
increment i by 1. 

3. If i < i + 1, perform step 2. 

-In practice, the number of iterations L which suffice to converge to the 
solution within reasonable accuracy depends on the structure of the network. 
Alternatively, one may use - + |tft('+i) _ ^t(')| < ^ for the 

convergence criteria. 



IV. Estimation of the Throughput and Average 
Packet Delay 

In this section, we exploit the results of the iterative esti- 
mation method for buffer occupancy distributions and obtain 
analytical expressions for throughput and average delay. 

Since the routing scheme is such that information is not 
replicated at any node, the estimate of the total information 
that arrives at the destination is the sum total of the information 
rate arriving on each of its incoming edges. Hence, 



C[s,d,~d)^ ^ gl^[l-ql^) = 



Bvdi 



(8) 



v£j\f+{d) 



where we let {Quv^Quv) ^'^ either the component- wise limit 
of the sequence {guv,quv}ie]N when L = oo, or {g^uv ,q^) 
when L < oo. Additionally, by the conservation of information 
flow, the above estimate can be obtained by computing the rate 
of flow of information through any cut F using the following. 



(9) 



As it is defined in Section IIII-AI the operation scheme is 
chosen to have feedback on all the links and we treat packets in 
a First-Come First-Serve (FCFS) fashion at the buffers. Also, 
the absence of directed cycles allows us to assign an order 
vi,V2, ■ ■ ■ ,Vn to all the nodes of the network in a manner that 
we have i < j for every link (vi, Vj) € ~^ . 

In order to estimate the average delay that a packet experi- 
ences in the network, one can proceed in a recursive fashion. 
The average delay that an arriving packet (at time I) at node 
u E V experiences depends on the buffer occupancy of the 
node u and its outgoing links. For example, suppose at epoch 
I (packet arrival time), node u has already k packets where 
k < niu — 1. Then, the arriving packet has to wait for the 
first k packets to leave node u before it can be transmitted. 
We define Vuik) as the average time it takes from the instant 
that node u receives a packet given that it has already stored k 
packets, until the time that the destination node receives that 
packet. We compute the average delay function 2?u(-) for all 
the intermediate nodes u G using the following proposition. 

Proposition 1: Let ruv — SuvQuv be the average packet 
transfer rate on link (u, u) G and r~ be the sum of the rates 
on aU outgoing edges (i.e., = E^gA/-- (u) ''™)- ^l^o, let 
TTu{j) ij = 0, 1, . . . , rriv — 1) be the steady state probability of 
the buffer of node v G V storing already j packets right before 
a new packet arrives and is stored in the buffer For every 
intermediate node u E V, given the average delay functions 
of all its next-hop neighbors {Vuij) for all v G Af~{u) and 
j = 0,1, . . . , rriu — 1), T>u{.) can be obtained by 



■Duik) ■■ 



k + 1 



E 

w^J\f (u) 



T'uw 



X! (10) 

3=0 



for fc = 0, 1, . . . , nriu — 1. 

Proof: Due to space limitation, we just provide a brief 
sketch of the proof. Equation ( fTOb is consisted of two terms: 
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1. The first term represents the average time it takes for a 
total of fc + 1 packets (counting our selected packet) to 
leave node u successfully. 

2. The second term relates to the average delay due to 
the rest of the network. The probability of conveying 
a packet from node u to node v can be estimated by 

An arriving packet at node v finds its buffer already 
occupied by j packets with probability Try{j). Thus, 
the packet will experience an average delay of 
from this node to the destination. Hence, the average 
packet delay from node v to the destination is equal to 



Node 3 



Node 4 



It is easy to see that TTy{j) can be calculated using 

<UL_ j = l,2,...,m„-l 

j = m,„ 



i-i?r(m„) 





(11) 



To obtain the average packet delay from the source to the 
destination, the average delay function X'u(.) is computed for 
all the nodes in the reverse ordej^ (i.e.,{vn, . . . ,V2,vi})- Then, 
the total average packet delay {Vs{0)) is computed by applying 
Proposition [1] to the source node. 

V. Simulation Results 

In this section, we present the results of actual network 
simulations in comparison with our analysis and will show that 
our framework gives accurate estimates of buffer occupancy 
distributions as well as throughput and average delay. 

We consider the network shown in Fig. |3] to compare the 
results of the simulation and inferences. In this network, all 
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Fig. 5. Performance parameters of the network for different buffer sizes. 



Fig. 3. A general wired acyclic directed network chosen for simulation). 

the edges have e = 0.5 (erasure probability) except the edges 
{(1, 2), (1, 3), (15, 17), (16, 17)} for which e ^ 0.05. All the 
intermediate nodes are assumed to have the same buffer size. 
In order to measure the exact performance parameters of 
this network, millions of packets are sent from the source 
(Node 1) to the destination (Node 17). Fig. |4] presents a 
comparison between the actual buffer occupancy distributions 
and our iterative estimates for four of the nodes in the network 
of Fig. [3] Also, Fig. |5] presents the variations of the actual 
throughput and average packet delay and our analytical results 
versus the buffer size. Note that, the throughput is presented in 
packets/epoch and average packet delay is presented in epochs. 
As it can be observed, our estimation is very close to the actual 
simulation results. 
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